NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

GenoMiX: Accelerated Simultaneous Analysis of Human Genomics, Microbiome Metagenomics, and Viral Sequences

https://doi.org/10.1109/BioCAS58349.2023.10388531

Zhang, Tianqi; González, Antonio; Moshiri, Niema; Knight, Rob; Rosing, Tajana (October 2023, IEEE)

Full Text Available
RAPIDx: High-performance ReRAM Processing in-Memory Accelerator for Sequence Alignment

https://doi.org/10.1109/TCAD.2023.3239537

Xu, Weihong; Gupta, Saransh; Moshiri, Niema; Rosing, Tajana (January 2023, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

Full Text Available
Accelerating open modification spectral library searching on tensor core in high-dimensional space

https://doi.org/10.1093/bioinformatics/btad404

Kang, Jaeyoung; Xu, Weihong; Bittremieux, Wout; Moshiri, Niema; Rosing, Tajana; Kelso, ed., Janet (June 2023, Bioinformatics)

Abstract MotivationDriven by technological advances, the throughput and cost of mass spectrometry (MS) proteomics experiments have improved by orders of magnitude in recent decades. Spectral library searching is a common approach to annotating experimental mass spectra by matching them against large libraries of reference spectra corresponding to known peptides. An important disadvantage, however, is that only peptides included in the spectral library can be found, whereas novel peptides, such as those with unexpected post-translational modifications (PTMs), will remain unknown. Open modification searching (OMS) is an increasingly popular approach to annotate modified peptides based on partial matches against their unmodified counterparts. Unfortunately, this leads to very large search spaces and excessive runtimes, which is especially problematic considering the continuously increasing sizes of MS proteomics datasets. ResultsWe propose an OMS algorithm, called HOMS-TC, that fully exploits parallelism in the entire pipeline of spectral library searching. We designed a new highly parallel encoding method based on the principle of hyperdimensional computing to encode mass spectral data to hypervectors while minimizing information loss. This process can be easily parallelized since each dimension is calculated independently. HOMS-TC processes two stages of existing cascade search in parallel and selects the most similar spectra while considering PTMs. We accelerate HOMS-TC on NVIDIA’s tensor core units, which is emerging and readily available in the recent graphics processing unit (GPU). Our evaluation shows that HOMS-TC is 31× faster on average than alternative search engines and provides comparable accuracy to competing search tools. Availability and implementationHOMS-TC is freely available under the Apache 2.0 license as an open-source software project at https://github.com/tycheyoung/homs-tc.
more » « less
NiemaGraphGen: A memory-efficient global-scale contact network simulation toolkit

https://doi.org/10.46471/gigabyte.37

Moshiri, Niema (January 2022, Gigabyte)

Full Text Available
An Evaluation of Phylogenetic Workflows in Viral Molecular Epidemiology

https://doi.org/10.3390/v14040774

Young, Colin; Meng, Sarah; Moshiri, Niema (April 2022, Viruses)

The use of viral sequence data to inform public health intervention has become increasingly common in the realm of epidemiology. Such methods typically utilize multiple sequence alignments and phylogenies estimated from the sequence data. Like all estimation techniques, they are error prone, yet the impacts of such imperfections on downstream epidemiological inferences are poorly understood. To address this, we executed multiple commonly used viral phylogenetic analysis workflows on simulated viral sequence data, modeling Human Immunodeficiency Virus (HIV), Hepatitis C Virus (HCV), and Ebolavirus, and we computed multiple methods of accuracy, motivated by transmission-clustering techniques. For multiple sequence alignment, MAFFT consistently outperformed MUSCLE and Clustal Omega, in both accuracy and runtime. For phylogenetic inference, FastTree 2, IQ-TREE, RAxML-NG, and PhyML had similar topological accuracies, but branch lengths and pairwise distances were consistently most accurate in phylogenies inferred by RAxML-NG. However, FastTree 2 was the fastest, by orders of magnitude, and when the other tools were used to optimize branch lengths along a fixed FastTree 2 topology, the resulting phylogenies had accuracies that were indistinguishable from their original counterparts, but with a fraction of the runtime.
more » « less
Full Text Available
SALIENT: Ultra-Fast FPGA-based Short Read Alignment

https://doi.org/10.1109/ICFPT56656.2022.9974548

Khaleghi, Behnam; Zhang, Tianqi; Martino, Cameron; Armstrong, George; Akel, Ameen; Curewitz, Ken; Eno, Justin; Eilert, Sean; Knight, Rob; Moshiri, Niema; et al (December 2022, 2022 International Conference on Field-Programmable Technology (ICFPT))

Full Text Available
Accelerators for Classical Molecular Dynamics Simulations of Biomolecules

https://doi.org/10.1021/acs.jctc.1c01214

Jones, Derek; Allen, Jonathan E.; Yang, Yue; Drew Bennett, William F.; Gokhale, Maya; Moshiri, Niema; Rosing, Tajana S. (July 2022, Journal of Chemical Theory and Computation)

Full Text Available
The ViReflow pipeline enables user friendly large scale viral consensus genome reconstruction

https://doi.org/10.1038/s41598-022-09035-w

Moshiri, Niema; Fisch, Kathleen M.; Birmingham, Amanda; DeHoff, Peter; Yeo, Gene W.; Jepsen, Kristen; Laurent, Louise C.; Knight, Rob (March 2022, Scientific Reports)

Abstract Throughout the COVID-19 pandemic, massive sequencing and data sharing efforts enabled the real-time surveillance of novel SARS-CoV-2 strains throughout the world, the results of which provided public health officials with actionable information to prevent the spread of the virus. However, with great sequencing comes great computation, and while cloud computing platforms bring high-performance computing directly into the hands of all who seek it, optimal design and configuration of a cloud compute cluster requires significant system administration expertise. We developed ViReflow, a user-friendly viral consensus sequence reconstruction pipeline enabling rapid analysis of viral sequence datasets leveraging Amazon Web Services (AWS) cloud compute resources and the Reflow system. ViReflow was developed specifically in response to the COVID-19 pandemic, but it is general to any viral pathogen. Importantly, when utilized with sufficient compute resources, ViReflow can trim, map, call variants, and call consensus sequences from amplicon sequence data from 1000 SARS-CoV-2 samples at 1000X depth in < 10 min, with no user intervention. ViReflow’s simplicity, flexibility, and scalability make it an ideal tool for viral molecular epidemiological efforts.
more » « less
Ultra Efficient Acceleration for De Novo Genome Assembly via Near-Memory Computing

https://doi.org/10.1109/PACT52795.2021.00022

Zhou, Minxuan; Wu, Lingxi; Li, Muzhou; Moshiri, Niema; Skadron, Kevin; Rosing, Tajana (September 2021, 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT))

Full Text Available
Ten simple rules for attending your first conference

https://doi.org/10.1371/journal.pcbi.1009133

Leininger, Elizabeth; Shaw, Kelly; Moshiri, Niema; Neiles, Kelly; Onsongo, Getiria; Ritz, Anna (July 2021, PLOS Computational Biology)
Schwartz, Russell (Ed.)
Full Text Available

« Prev Next »

Search for: All records